# English Audio Processing
Qwen2 Audio 7B Instruct GGUF
Apache-2.0
Static quantized version of Qwen2-Audio-7B-Instruct model, supporting English audio-to-text conversion tasks
Audio-to-Text
Transformers English

Q
mradermacher
146
0
Ichigo Llama3.1 S Instruct V0.4 GGUF
Apache-2.0
A statically quantized model based on Menlo/Ichigo-llama3.1-s-instruct-v0.4, offering multiple quantization versions to suit different hardware requirements.
Large Language Model English
I
mradermacher
369
1
My Awesome Mind Model
Apache-2.0
An audio classification model fine-tuned on the minds14 dataset based on facebook/wav2vec2-base
Audio Classification
Transformers

M
faaany
1
0
Mini Ichigo Llama3.2 3B S Instruct
Apache-2.0
The Ichigo-llama3s series model is a multimodal language model developed by Homebrew Research, natively supporting audio and text input comprehension. Based on the Llama-3 architecture, it is trained using WhisperVQ as an audio file tokenizer, enhancing its audio understanding capabilities.
Text-to-Audio English
M
Menlo
22
34
Wav2vec2 Gpt2 Wandb Grid Search
Automatic Speech Recognition (ASR) model trained on the LibriSpeech dataset
Speech Recognition
Transformers

W
sanchit-gandhi
13
0
Featured Recommended AI Models